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Abstract — In this paper, polar codes for the m-user multiple 
access channel (MAC) with binary inputs are constructed. It is 
shown that Arikan's polarization technique applied individually 
to each user transforms independent uses of a m-user binary 
input MAC into successive uses of extremal MACs. This trans- 
formation has a number of desirable properties: (i) the 'uniform 
sum rateQ of the original MAC is preserved, (ii) the extremal 
MACs have uniform rate regions that are not only polymatroids 
but matroids and thus (iii) their uniform sum rate can be reached 
by each user transmitting either uncoded or fixed bits; in this 
sense they are easy to communicate over. A polar code can then 
be constructed with an encoding and decoding complexity of 
0{n log n) (where n is the block length), a block error probability 
of o{exp{~n^'''^~^)), and capable of achieving the uniform sum 
rate of any binary input MAC with arbitrary many users. An 
application of this polar code construction to communicating on 
the AWGN channel is also discussed. 
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1. Introduction 

The polarization technique, introduced by Arikan in |2|, 
. transforms n independent uses of a noisy binary input channel 
into single-uses of n synthetic binary input channels. The 
key property of this transformation is that almost all of these 
synthetic channels are polarized, in the sense that they are 
either very noisy or almost noiseless (i.e., having a mutual 
information either close to or to 1). Moreover, this technique 
preserves the 'uniform mutual information' — the mutual 
information of the channel with the uniform input distribution 
— that is, the proportion of synthesized channels that are 
'almost noiseless tends to the uniform mutual information. 
.As the very noisy or almost noiseless channels are channels 
for which it is easy to code, this transformation leads to the 
following coding scheme: uncoded information bits are sent on 
the polarized channels that have uniform mutual informations 
close to 1, and on the remaining channels, bits frozen to pre- 
determined values are transmitted. 

In addition to bringing a new perspective on coding, polar 
codes can be implemented with low computational effort. 
More precisely, the encoding and decoding complexity of a 
polar code is 0{n\ogn). By definition of the uniform mutual 
information, these codes achieve the capacity of any channel 
whose capacity achieving input distribution is uniform. The 
original polar code construction was generalized in (13] for 
channels with binary input alphabets to channels with alpha- 

'in this paper all mutual informations are computed when the inputs of a 
MAC are distributed independently and uniformly. The resulting rate regions, 
sum rates, etc., are prefixed by 'uniform' to distinguish them from the capacity 
region, sum capacity, etc. 



bets of arbitrary prime cardinality, allowing polar codes to 
approach the capacity of any discrete memoryless channel. 

In this paper, we show how the polarization technique can be 
extended to a multi-user problem, namely, the multiple access 
channel (MAC). One of the interesting aspect of this extension 
is that, as opposed to the single-user setting where a single 
mutual information characterizes an achievable rate, there is 
in a MAC setting a collection of mutual informations that 
characterize an achievable rate region. Hence, the terminology 
"polarized" may need to be revised in a MAC setting, as there 
may be more than two "polarized MACs". Indeed, for a 2- 
user binary input MAC, by applying Arikan's construction to 
each user's input separately, lfT4l shows that n independent 
uses of a 2-user MAC are converted into n successive uses of 
five possible "extremal 2-user MACs". These 2-user extremal 
MACs are: (i) each user sees a pure noise channel, (ii) one 
of the user sees a pure noisy channel and the other sees a 
noiseless channel, (iii) both users see a noiseless channel, 
(iv) a pure contention channel: a channel whose uniform rate 
region is the triangle with vertices (0,0), (0,1), (1,0). Note 
that for this channel, if any of the two users communicates at 
zero rate, the other user sees a noiseless channel. Moreover 
(14] shows that the uniform sum rate of the original MAC 
is preserved during the polarization process, and that the 
polarization to the extremal MACs occurs fast enough, so as to 
ensure the construction of a polar code with vanishing block 
error probability, achieving uniform sum rate on binary inputs 
2-user MACs. 

In contrast to (14], here we investigate the polarization of 
the MAC for an arbitrary number of users. In the two user 
case, the extremal MACs are not just MACs for which each 
users sees either a noiseless or pure noise channel, as there is 
also the pure contention channel. However, the uniform rate 
region of the 2-user extremal MACs are all polyhedrons with 
integer valued constraints. So, the first interesting aspect of 
the polarization of the MAC with arbitrary many users is to 
understand what pattern do extremal MACs follow. We will 
see that the 2-user and 3-user cases can be handled in a similar 
manner, whereas a new phenomenon appears when the number 
of users reaches 4, and the extremal MACs are no longer 
in a one to one correspondence with the polyhedrons having 
integer valued constraints. To characterize the extremal MACs, 
we first show how a relationship between random variables 
defined in terms of mutual information falls precisely within 
the independence notion of the matroid theory. This connection 
is used to show that the extremal MACs are in a one to one 



correspondence with binary matroids, and are "equivalent" (in 
a sense which will be defined later) to linear deterministic 
MACs. This is then used to conclude the construction of a 
polar code ensuring reliable communication on binary input 
MACs for arbitrary values of m. 

Finally, we discuss two applications resulting from the MAC 
polar code construction with arbitrary many users described in 
this paper The first one is motivated by the idea of proposing 
a new coding scheme for the additive white Gaussian noise 
channel. By transmitting the standardized average of m binary 
inputs which are uniformly distributed (taking into account 
the power constraint), we transmit a random input which is 
approximately Gaussian distributed when m is large (using 
the central limit theorem). This is important to achieve the 
highest rate on the AWGN channel, since the Gaussian input 
distribution maximizes the mutual information for this chan- 
nel. We can then use the polar code construction for a MAC 
developed in this paper to propose a new coding scheme for 
the AWGN channel. In the second application, we construct 
polar codes achieving the uniform sum-rate of MACs with 
g-ary inputs, where g is a power of 2, using the polar code 
construction for MACs with binary inputs and a large enough 
number of users. We also show how, with this extension, the 
sum-capacity of any m-user MAC can be achieved. 

II. Polarization Process for MACs 

We consider a m-user multiple access channel with binary 
input alphabets (BMAC) and arbitrary output alphabet 3^. The 
channel is specified by the conditional probabilities 

P{y\x), for ally ey and x = {x[l], . . . , x[m]) G F^. 

Let Em ■— {I,---,™} and let . . . , X[to] be mu- 

tually independent and uniformly distributed binary random 
variables. Let X (-'^[l], • ■ .,X[m\). We denote by Y the 
output of the MAC P when the input is X. For J C Em, we 
define 

X[J] ■= {X\i] : i e J}, 
I[J]{P) := I{X[J];YX[r]), 

where J'^ denotes the complement set of J in Em, and 

I{P) : 2^" ^ M 

J ^i[JKP) (1) 

where 2^™ denotes the power set of Em and where /[0](P) = 
0. Note that 

I{P) ■.= {{Ri,...,Rm) ■■ 0<^i?. < J[J](P), VJCS,„} 

ieJ 

is included in the capacity region of the MAC P. We refer 
to I{P) as the uniform rate region and to I[Em]{P) as the 
uniform sum rate. We now consider two independent uses of 
such a MAC. We define 

Xi (Xi[l],...,Xi[m]), X2 := (X2[l], . . . , 

where Xi[i],X2[?], with i e Em, are mutually independent 
and uniformly distributed binary random variables. We denote 



by Yi and Y2 the respective outputs of independent uses of 
the MAC P when the inputs are Xi and X2- 

Xi4ri, X2^Y2. (2) 

We define two additional binary random vectors 

C7i := {Ui[ll...,Ui[m]), U2 := (C/2[l], . . . , (JsH) 

with mutually independent and uniformly distributed compo- 
nents, and we put Xi and X2 in the following one to one 
correspondence with Ui and U2'. 

Xi=Ui+U2, X2^U2, 

where the addition in the above is the modulo 2 component 
wise addition. 

Definition 1. Let P : F^" ^ 3^ be a m-user BMAC. We define 
two new m-user BMACs, p- : y'^ and P+ : F^" 

y^ X ¥rp, by 

P^{yi,y2\ui) ■■= ^ ■^P{yi\ui + u2)P{y2\u2), 

P~^{yi,y2,ui\u2) -^P{yi\ui + u2)P{y2\u2), 
for all e W^, yiey,i = l, 2. 

That is, we have now two new m-user BMACs with 
extended output alphabets: 

tJi ^ iYi,Y2), U2 ^ (Yi,Y2,Ui) (3) 

which also defines /[J](P") and I[J]{P+), VJ C Em- 

This construction is the natural extension of the construction 
for m = 1, 2 in ||2l, lfT4l . Here again, we are comparing two 
independent uses of the same channel P (cf. (|2])) with two 
successive uses of the channels P^ and P+ (cf. (|3]l). Note 
that 

nj]{p-) < nj]{p) < nJKP^), vj c Em. 

Definition 2. Let {-B„}„>i be i.i.d. uniform random variables 
valued in { — ,+}. Let the BMAC valued random process 
{Pn, n > 0} be defined by 

^0 P, 

Pn:^P^"i, Vn>l. (4) 

A. Discussion 

When m = 1, we have 2/(P) = I{p-) + /(P+), which 
implies that /(Pn) (which in this case denotes a sequence of 
scalar random variables and not of functions) is a martingale. 
This allows to show that /(Pn) tends to either or 1, and 
the extremal channels of the single-user polarization scheme 
are either pure noise or noiseless channels. Moreover, in the 
polarization of the single-user channel, the extremal channels 
are synthesized by using a genie aided decoder The genie 
helps the decoder in providing the correct values of the 
previous decisions when decoding the current channel's input. 
In the polar code construction the genie is simulated by a 



decoder which decodes the bits successively on the synthetic 
channels, and uses its previous decisions assuming they are 
correct. As the block error probability of the genie aided and 
the standalone decoder are exactly the same, it is sufficient to 
study the block error probability of the genie aided decoder. 
These facts then facilitates the design of a code: bits are frozen 
on the very noisy channels and uncoded information bits are 
sent on the almost noiseless channels, recovered then by using 
a successive decision decoder at the receiver To show that 
the block error probability of this coding scheme is small, 
i.e., that the error caused by the successive decision decoder 
does not propagate, it is shown that the convergence to the 
"good" extremal channels is fast enough. When m > 2, several 
new points need to be investigated. In particular, one needs to 
check whether /[J](P„) still has a martingale property for 
different J's. Then, if the convergence of each /[J](P„) can 
be proved, one has to examine whether the obtained limiting 
MACs are also extremal MACs, along the spirit of creating 
trivial channels to communicate over, as in the single-user 
polarization. Finally, one needs to ensure that the convergence 
of these mutual informations is taking place fast enough, so 
as to ensure a block error probability that tends to zero when 
successive decision decoding is used. 

III. Preliminary Results 

Summary: In Section UlI-AI we show that I{Pn) tends a.s. 
to a matroid rank function (cf. Definition |5j. We then see 
in Section IIII-BI that the extreme points of a uniform rate 
region with matroidal constraints can be achieved by each 
user sending uncoded or frozen bits; in particular the uniform 
sum rate can be achieved by such strategies. We then show 
in Section |IV] that for arbitrary m, I{Pn) tends not to an 
arbitrary matroid rank function but to the rank function of a 
binary matroid (cf. Definition |6]l. This is used to show that 
the convergence to the extremal MACs happens fast enough, 
which then leads to the main result of this paper. Theorem 
[8] in Section |IV] This theorem states that applying Arikan's 
polar transform separately to each user, and using a successive 
decision decoder can achieve sum rates arbitrarily close to the 
uniform sum rate of a MAC, ensure block error probability 
that decays roughly like with block length, and operate 

with computational complexity 0(?ilogn). 

A. The extremal MACs 

Lemma 1. {/[J](P„),n > 0} is a bounded super-martingale 
when J Em and a bounded martingale when J = Em- 
Proof: For any J C Em, I[J]{Pn) < rn and 
2/[J](P) = I{X,[J]X2[J];YiY2Xi[r]X2[r]) 
= I{Ui[J]U2[J];YiY2Ui[r]U2[r]) 
= IiUi[J];YiY2Ui[r]U2[r]) 

+ I{U2[J];YiY2Ui [r] U2[r]Ui[J]) 
> I{Ui[J];YiY2Ui[r]) 

+ I{U2[J];YiY2UiU2[r]) 
= I[J]{p-)+I[J]{P+). (5) 



If J = Em, the inequality above is an equality. ■ 
Note that the inequality in the above are only due to 
the bounds on the mutual informations of the P^ channel. 
Because of the equality when J = Em, our construction 
preserves the uniform sum rate. As a corollary of previous 
Lemma, we have the following result. 

Lemma 2. The process {/[J](P„), J C Em} converges a.s., 
i.e., for each J C Em, limn-^-oo I[J]{Pn) exists a.s.. 

Note that for a fixed n, {/[J](P„), J C Em} denotes the 
collection of the 2™ random variables J[J](P„), for J C Em- 
When the convergence takes place (this is an a.s. event), let 
us define 

Iao[J] ■-= lim /[J](P„) 

and loo to be the function J /oo [J] ■ 

From previous theorem, /qo [J] is a random variable valued 
in [0, 1 J|]. We will now further characterize these random 
variables. 

The following Lemma is proved in 13. 

Lemma 3. For any e > 0, there exists 6 > such that 
7(^2 ; B1B2A1) - I{A2\B2) < 5 implies 

I{A2;B2) e [0,e)U (l-e,l], 

whenever (Ai, A2, Bi, B2) are random variables valued in 
¥2 X ¥2 X B X B, with B any set, and 

PAiA2BiB2{ai,a2,bi,b2) = ^Q{bi\ai + a2)Q(62|a2), 

for any ai G F2, hi Cz B, i = 1,2, where Q is a binary input 
B-output channel. 

This Lemma is used to prove the following. 

Lemma 4. For any e > and any m-user BMAC P, there 
exists (5 > 0, such that for any J C Em, if I[J]{P^) ^ 
I[J]{P) < S, we have 

I[J]{P)- I[J\i]{P) € [0,e)U(l-e,l], Vi G J, 

where /[0](P) = 0. 

Proof: Let i G J. Note that 

I[J]iP+)-I[J]iP) 

= I{U2[J];YiY2UiU2[r]) - I{U2[J];Y2U2[r]) 

= I{U2[J];YiUi\Y2U2[r]) 

> /(?72 M ; Y, Ui [z] Ui [r] \Y2 U2 [J'] ) 

= j(c/2H;FiC/i[J^]r2C/2[J^]c/iW) - i{U2[A\Y2U2[.r]) 

= I{U2[i];YiXi[r]Y2X2[r]Ui[i]) - I(U2[i];Y2X2[r])- 

Using Lemma [3] with Ak = Uk[i\, Bk = YkXk[J% k ^ 1,2, 
and 

Q{y,a:[r]\x[i])^^ ^ P{y\x), 

xl]]e¥2.j(fj'u{i} 



we conclude that we can take S small enough, so that 

I[J]{P+)-I[J]{P) < (5implies/(C/2W;F2^2[^1) e [0,£)U 
(1 — e, 1]. Moreover, we have 

i[j]{p) = i[j\i]{p) + imi];y2X2[r]). 

■ 

Lemma 5. With probability one, Ioo[J] — I<xj[J\i] S {0, 1}, 
yj CEm,ie J, where /oo[0] 0. 

Proof: From Lemma |2] we have that /[J](P„) converges 
a.s., hence lim„_j.oo l^[>^](^n+i) ~ = a.s. More- 

over, by definition of P„, |/[J](P„+i) — /[J](P„)| is equal 
to I[J]{P+) - I[J]{Pn) w.p. half and /[J](P„) - /[J](P-) 
w.p. half. Hence, from ©, E|/[J](P„+i) - I[J]{Pn)\ > 
i(/[J](P+) - /[J](P„)). But |/[J](P„+i) - /[J](P„)| is 
bounded by m, hence lim„_i.oo IE|/[J] (P„+i ) — /[,/] (P„)| — 
and lim„_>oo I[J]{Pn) ~ ^[J]{Pn) = 0. Finally, we conclude 
using Lemma |4] ■ 
Note that Lemma |5] implies in particular that {/oo['/], J ^ 
Em} is a.s. a discrete random vector 

Definition 3. We denote by Am the support of {Ioo[J],J ^ 
Em} (when the convergence takes place, i.e., a.s.). This is a 
subset of {0, . . . , m}^ . 

We have already seen that not every element in 
{0, . . . , m}^ can belong to Am- We will now further char- 
acterize the set Am- 

Definition 4. A polymatroid is a set called a ground set, 
equipped with a function / : 2™ R (where 2™ denotes the 
power set of Em), called a rank function, which satisfies 

/(0)-o, 

f[J]<f[Kl yjCKCEm, 
nJUK]+f[JnK] < f[J]+f[K], yj,KCEm- 

A proof of the following result can be found in |15|. 

Theorem 1. For any MAC and any distribution of the inputs 
X[E], we have that p{S) = I{X[S]]Y XlS"]) is a rank 
function on E, where we denote by Y the output of the MAC 
when the input is X[E]. Hence, {E,p) is a polymatroid. 

Therefore, any realization of I{Pn) is a rank function and 
the elements of Am are the image of a polymatroid rank 
function. 

Definition 5. A matroid is a polymatroid whose rank function 
is integer valued and satisfies /(J) < |J|, VJ C Em- We 
denote by MAT^ the set of all matroids with ground state 
Em- We use the notation ra to refer to the rank function of 
a matroid B. We will sometimes identify a matroid with its 
rank function image, in which case, we consider an element 
of MAT„j as a 2™ dimensional integer valued vector. We 
also define a basis of a matroid by the collection of maximal 
subsets of Em for which /(J) = | J|. 

Using Lemma |5] and the definition of a matroid, we have 
the following result. 



Theorem 2. For every m > 1, Am Q MAT„i, i.e., I^o is a 
matroid rank function. 

We will see that the inclusion is strict for m > 4. 

B. Communicating on MACs with matroidal regions 

We have shown that, when n tends to infinity, the MACs 
that we create with the polarization construction of Section 
im are particular MACs: the mutual informations I^o [J] are 
integer valued (and satisfy the other matroid properties). A 
well-known result in matroid theory (cf. Theorem 22 of |4|) 
says that the vertices of a polymatroid given by a rank function 
/ are the vectors of the following form; 

^j(i) = /(^i)> 

xm = /(A,) - V2 < I < fc 

yk < i < m, 

for some k < m, j(2), . . . , j(m) distinct in Em and 
Ai = {j{l),j{2), . . . ,j{i)}, where the vertices strictly in the 
positive orthant are given for k ~ m. 

Therefore, we have the following corollary. 

Corollary 1. The uniform rate region defined by an element 
of Am hos vertices on the hypercube {0,1}™. In particular, 
to communicate at a rate m-tuple which is a vertex of such a 
MAC uniform rate region, each user communicates on either 
a noiseless or pure noise channel. 

C. Convergence Speed and Representation of Matroids 

Convention: for a given m, we write the collection 
{/oo[J], J ^ Em} by skipping the empty set (since /oo[0] = 
0) as follows: when m = 2, we order the sequence as 
(/oo[l],/oo[2],/oo[l,2]), and when m = 3, as [I^[l]J^[2], 
/oo[3], /oo[l,2],/oo[l,3],/oo[2,3],/oo[l,2,3]), etc. 

In this section, we show that there is a correspondence 
between the extremal MACs and the linear deterministic 
MACs, i.e., MACs whose outputs are linear forms of the 
inputs. This correspondence has been used in llT4l to establish 
that the convergence to the extremal MACs for the 2-user case 
is fast, namely o(2^" ) for any (5 < 1/2, which allows to 
conclude that the block error probability of the code described 
in fTTl is small. We hence follow the same approach as in f[4\ 
to treat the case where the number of users is arbitrary, and 
proceed here to establish this correspondence. We will see 
that while the case to = 3 is similar to the case to = 2, 
a new difficulty is encountered for m > 4. How to use this 
correspondence in order to show that the the convergence to 
the extremal MACs for the m-user case is fast is done in 
Section HV] 

Note that a property of the matroids {(0, 0, 0), (0, 1, 1), 
(1, 0, 1),(1, 1, 1), (1, 1, 2)} is that we can express any of them 
as the uniform rate region of a linear deterministic MAC: 
(1,0,1) is in particular the uniform rate region of the MAC 
whose output isY = X[l], (0, 1, 1) corresponds to Y = X[2], 
(1, 1, 1) to r = X[l]+X[2] and (1, 1, 2) to F = {X[1],X[2]). 
Indeed, this is related to the fact that any matroid with a two 



element ground state can be represented in the binary field. 
Let us introduce the definition of binary matroids. 

Definition 6. Linear matroids: let A be a /c x m matrix over 
a field. Let be the index set of the columns in A. The 
rank of J C Em is defined by the rank of the sub-matrix with 
columns indexed by J. 

Binary matroids: A matroid is binary if it is a linear matroid 
over the binary field. We denote by BMATm the set of binary 
matroids with m elements. 

1) The Case m — Z: MAT3 is given by 8 unlabeled 
matroids (16 labeled ones). Moreover, they are all binary 
representable (there are 16 labeled binary matroids). For 
example, it is clear that the deterministic MAC whose output 
is X\\\ + X[2] + X[3] has a uniform rate region given 
by (1,1,1,1,1,1,1). Similarly, all matroids for to = 3 
correspond to the rate region of a linear deterministic MAC. 
However, one can also show that any 3-user binary MAC 
with uniform rate region given by a matroid is equivalent to a 
linear deterministic MAC in the following sense. A MAC with 
output Y and uniform rate region given by (1,1,1,1,1,1,1) 
must satisfy I(X[l] + X[2] + X[i\-Y) = 1, and similarly 
for other matroids (with m = 3), where the linear forms of 
inputs which can be recovered from the output are dictated by 
the binary representation of the matroid. However, the above 
claim is not quite sufficient to show that, if {/[J](P„), J C 
Em} tends to (1, 1, 1, 1, 1, 1, 1), we have along this path that 
/((p[i,2,3]>)^>) fgj^jjg where p[i'2,3] jg jj^g channel with 

input X[l\ +^[2] +^[3] and output Y . For this, one can show 
a stronger version of the claim which says that if a MAC has 
a uniform rate region "close to" (1, 1, 1, 1, 1, 1, 1), it must be 
that I{X[l] + X[2\ + X[3];y) is close to 1. In any case, a 
similar technique as for the m — 2 case lets one show that 
the convergence to the matroids in ^3 must take place fast 
enough. 

2) The Case m = A: We have that MAT4 contains 17 
unlabeled matroids (68 labeled ones). However, there are only 
16 unlabeled binary matroids with ground state 4. Hence, there 
must be a matroid which does not have a binary representation. 
This matroid is given by (1,1,1,1,2,2,2,2,2,2,2,2,2,2) 
(one can easily check that this is not a binary matroid). It 
is denoted 1/2 4 and is called the uniform matroid of rank 2 
with 4 elements (for which any 2 elements set is a basis). Of 
course, that this matroid is not binary, does not imply that an 
hypothetic convergence to it must be slow. It means that we 
will not be able to use the technique employed for the case 
TO = 2,3. 

Luckily, one can show that there is no MAC leading to U2,i 
and the following holds. 

Lemma 6. C BMAT4 C MAT4. 

Hence, the to = 4 case can be treated in a similar manner 
as the previous cases. We conclude this section by proving the 
following result, which implies Lemma |6] 



with four users and binary inputs. 

Proof: Assume that C/2,4 is the uniform rate region of a 
MAC. We then have 

/(X[z,i];y) =0, (6) 
I{X[i,j]-YX[k,l])^2, (7) 

for all j, fc, I distinct in {1, 2, 3, 4}. 

Let j;o be in the support of Y. For x G F|, define 
¥{x\yo) = W{yo\x)/ J2ze¥i'^iyo\z)- Assume w.l.o.g. that 
Po := P(0,0,0,0|2/o) > 0. Then from ©, P(0, 0, *, *|yo) = 
for any choice of * which is not 0, and P(0, 1, *, *\yo) = 
for any choice of *, * which is not 1,1. On the other hand, 
from P(0, 1, 1, l|yo) must be equal to pq. However, we 
have form ^ that P(l, 0, *|yo) = for any choice of *, * 
(even for 1, 1 since we now have P(0, 1, 1, l|j/o) > 0)- At the 
same time, this implies that the average of P(l, 0, *, *\yo) over 
*,* is zero. This brings a contradiction, since from (|6]l, this 
average must equal to po- ■ 

Moreover, a similar argument can be used to prove a 
stronger version of Lemma |7] to show that no sequence of 
MACs can have a uniform rate region that converges to C/2,4. 

3) Arbitrary values of m: We have seen in the previous 
section that for to — 2,3,4, the extremal MACs have uniform 
rate region that are not any matroids but binary matroids. This 
fact can be used to show that for to = 2, 3, 4, {/[J](P„), J C 
Em} must tend fast enough to {Ioo[J],J ^ Em}- The 
details of this proof are provided in Section IIVI in words, 
by working with the linear deterministic representation of the 
MACs, the problem of showing that the convergence speed 
is fast in the MAC setting becomes a consequence of a 
result shown in [3| for the single-user setting. We now show 
that the correspondence between extremal MACs and linear 
deterministic MACs holds for any value of to. 

Definition 7. A matroid is BUMAC if its rank function can be 

expressed as r{J) = I{X[J];YX[J'']), J C where X[E] 
has independent and binary uniformly distributed components, 
and Y is the output of a binary input MAC with input x[E]. 
Note that the letters BU in BUMAC refer to the binary uniform 
(BU) inputs. 

Theorem 3. A matroid is BUMAC if and only if it is binary. 

The converse of this theorem is easily proved and the direct 
part, which can be found in 11], uses the following theorem. 

Theorem 4 (Tutte). A matroid is binary if and only if it has 
no minor that is U2a- 

In the following theorem, we connect extremal MACs to 
linear deterministic MACs. 

Theorem 5. Let X [E] have independent and binary uniformly 
distributed components. Let Y be the output of a MAC with 
input X[E] and for which f{J) = I{X[J]] YX[J'^\) is integer 
valued, for any J C E„i. Then, there exists a binary matrix 
A such that 



Lemma 7. U2,i cannot be the uniform rate region of any MAC 



I{AX[E]-Y) =T!inkA = f{Em). 



This theorem was originally proved using matroid theory 
notations and we refer to HI for this proof and other investi- 
gations regarding the connection between matroid theory and 
extremal MACs. We provide an alternate proof of this theorem 
in the Appendix. One can also show a stronger version of this 
theorem for MACs having a uniform rate region which is close 
to a matroid, this is provided in the Theorem |6] below, whose 
proof is also given in the Appendix. Note that Theorem [3] 
follows from Theorem |6l 

Theorem 6. Let X[E\ have independent and binary uniformly 
distributed components. For any e > 0, there exists 7(e) with 
the following properties: 

(i) 7(e) fli e ^ 0, 

(ii) Whenever Y is the output of a MAC with input X[E] 
and for which / : 2" 9 J h-> I{X[J];YX[J'']) satisfies 
maxjg2™ d{f(J), Z) < s, there exists a binary matrix A 
such that 

\I{AX[E];Y)-f{E^)\<^{e). 

Theorem [3] says that an extremal MAC must have (with 
probability one) the same uniform rate region as the one of 
a linear deterministic MAC, i.e., a MAC whose output is a 
collection of linear forms of the inputs. However, Theorem |6l 
says something stronger, namely, that from the output of an 
extremal MAC, one can recover a collection of linear forms of 
the inputs and essentially nothing else. In that sense, extremal 
MACs are equivalent to linear deterministic MACs. This 
also suggests that we could have started from the beginning 
by working with the quantities /(pi'-'l) := li^^^j Xi;Y) 
instead of I[J]{P) = I{X[J];YX[J'']) to analyze the polar- 
ization of a MAC. The second measure is the natural one to 
study a MAC, since it characterizes the rate region. However, 
we have just shown that it is sufficient to work with the 
first measure to characterize the uniform rate regions of the 
polarized MACs. Indeed, one can show that /((p[''l)„) tends 
either to or 1 and Eren §agoglu |12| has provided a direct 
argument showing that these measures fully characterize the 
uniform rate region of the extremal MACs. We use a similar 
argument for the proof of Theorem |5] given in the Appendix. 

D. Comment: Relationship between information and matroid 
theories 

The process of identifying which matroids can have a rank 
function derived from an information theoretic measure, such 
as the entropy, has been investigated in different works, cf. 
lfT6l and references therein. In particular, the problem of char- 
acterizing the entropic matroids has consequent applications 
in network information theory and network coding problems 
as described in |8|. 

Entropic matroids are defined as follows. Let i? be a finite 
set and X[E] = {Xi}i^E be a random vector with each 
component valued in a finite alphabet. Let h{I) h{X[I]). 

Theorem 7. h{-) is a rank function. Hence, {E, h) is a 
polymatroid. 



A (poly)matroid is then called entropic, if its rank function 
can expressed as the entropy of a certain random vector, 
as above. A proof of the previous theorem is available in 
li5J, [i9J. The work of Han, Fujishige, Zhang and Yeung, 
Q, ||5l, lfT6l has resulted in the complete characterization of 
entropic matroids for \E\ = 2, 3. However, the problem is 
open when \E\ > 4. Note that in our case, where we have 
been interested in characterizing BUMAC matroids instead of 
entropic matroids, we have also faced a different phenomenon 
when \E\ > 4. Other similar problems have been studied in 

Col. 

IV. Main result: polar codes for MACs 

In this section, we describe our polar code construction for 
the MAC and prove the main theorem of the paper. 

Let n — 2^ for some I E Z+ and let G„ = [11]^' denote 
the Z-th Kronecker power of the given matrix. Let U[k]" := 

iUi[kl...,U4k]) and 

X[kr = f/[fc]"G„, k e E,^. 

When X[Em]^ is transmitted over n independent uses of 
P to receive F", define for any i £ {1, . . . ,n} the channel 

P(,) : F™ ^ X F™('~^' (8) 

to be the channel whose inputs and outputs are Ui[Em] ~^ 

Y"W-^[Ern]. 

Let n > 1 and e„ > 0, classify each P(i) as either 
'polarized' or 'not polarized' according to the function 
being valued within £„ of Z or not. (We will choose an 
appropriate sequence {e„} below. For the moment note only 
that by Theorem|2] if e„ were any fixed constant, the channels 
are in the 'polarized' category except for a vanishing 
fraction of indices i.) For i for which is in the polarized 
category, set to be the integer within e„ of /(P(j-))[_E,„]. 
Theorem |6] let us conclude the existence of a x m matrix 
A, for which H{A,U,[E,n] \ Y"W~^[E„,]) < l{sn), that is 
to say the output of channel P(i) determines AiUi[E„i] with 
high probabilit}0. 

We now describe what we refer to as the polar encoder 
and decoder for the MAC. The encoder will be specified via 
the sets Bi C Em, the set of users sending data on 
These wiU be chosen as follows: If is not polarized Bi is 
empty. Otherwise, select ri linearly independent columns of 
the matrix Ai, and put k in Bi if and only if the fc'th column is 
selected. For a user k let Q[k] be the set of i for which k £ Bi. 
For each user k and i ^ Q[k] choose Ui[k] independently 
and uniformly at random, reveal all these 'frozen' choices to 
user k and also to the decoder The encoder for user k will 
transmit uncoded bits on channels included in G[k], on the 
other channels it will transmit the frozen values. 

The decoder operates by successively decoding J7i [£",„], 
U2[Em], Un[E„i]- At Stage i, having already decoded 

^Indeed, by Problem 4.7 in |6|, with probability at least 1 — 7(e,i)- 



U'^^^[E„i] (assume correctly, for the moment), it is in posses- 
sion of (y", W^^[Em]), the output of P(i). It can thus deter- 
mine AiUi[Em] with high probabiHty. Since it knows Ui[Bf], 
it can determine J^keB ^imUi[k], and as {AJfc] : k £ Bi} 
are Hnearly independent, it can determine Ui[Bi]. 

Observe that for this decoder to operate as described above, 
it needs the aid of a genie which provides it with U^^^[Em] 
at stage i of the decoding. Let Ui[Em] = MY", U'-^[Em]) 
denote the decoding function of such a decoder Observe 
now, that if we construct an unaided decoder via Ui[Em] — 
{/'^^[Sm]) using the same decoding function of the 
genie-aided decoder, the block error event for this unaided 
decoder U"[E„i] 7^ U^[E„i] is the same as the block error 
event U'"'[E„-,] 7^ [/"[£',„] of the genie aided decoder Thus, 
the block error probability of the unaided decoder Pe{n) is 
equal to the block error probability of the genie aided decoder 
and so can be upper bounded as 

Pe{n)<Y,Pe{P(€}.A,U,) 

i 

where Pe{Pi^i) , AiUi) is the probability of error in determining 
AiUi from the output of the channel Note now, that we 
have to be careful in our choice of e„: we need to take e„ small 
enough to ensure that nPe(P(i), AiUi) is small. We will see in 
the proof of Theorem 9 that channel polarization happens so 
rapidly that even with such a more stringent choice of e„ the 
fraction of non polarized channels vanishes with increasing n. 
(Indeed, for any (3 < 1/2, one can choose e„ — 2~" and still 
ensure polarization.) 

Since I[Em]{P) is preserved through the polarization pro- 
cess (cf. the equality in (|5]l), we guarantee that with 5„ 
denoting the fraction of unpolarized channels, 

Sum-Rate(n) := - V \g[k]\ > I[E,n]{P) - - 
n ^ — ' 

k=l 

Thus if £„ — ^ is chosen so that d„ — > 0, the communication 
system described above achieves the uniform sum rate of the 
underlying channel. The question as to if e„ can be chosen 
so that both (5„ — > and the block error probability decays to 
zero is answered in the affirmative by the theorem below. 

Theorem 8. For any m > 1, any binary input MAC P with 
m users, and any /3 < 1/2, there exists an integer uq and a 
sequence of codes with polar encoders and decoders described 
above such that the probability of error for a block length n 
satisfies 

Pe{n) < 2-"-" , Vn>no 



and 



lim inf Sum-Rate{n) > I[Em](P). 



As for the polar code in the single-user setting |l2], the 
encoding and decoding complexity of this code is 0{n\ogn). 

Proof of Theorem^ Fix a E {(3, 1/2), e G (0, 1/2) and 
£„ = 2^" . Let int(a;) denote the closest integer to x and 



define 



{i e{l, . . . , n} : I{P(;,))[J] e Z ± e for any J, 
3A, e F^'^'" with n = int(/(P(,)[i;™])) and 



IiA,U,[E^];Y"U[E„ 



> n 



For I e P„, we have H{A,U,[Em] \ < £„, 

and the output of channel P(i) determines AiUi[Em] with high 
probability, namely 



Therefore, 



PeiPi^),A,U^)<Sn. 
Pe{n) < ^ Pe{P^,^,A,Ui^ 

<ne„ = o(2-"'). 



(9) 



(10) 



(11) 



Hence, such a choice of e„ guarantees the first claim of the 
Theorem. We now show that such an £„ is still large enough to 
maintain most of the polarized MACs active, causing no loss 
in the sum-rate as stated in the second claim of the Theorem. 
To this end, we need the following definition and result. 

Definition 8. For a m-user BMAC P with output alphabet y 
and for S C Em, we define P^^^ to be the single-user binary 
input channel with output alphabet 3^, obtained from P by 



P{y\4Ern]) 



for all y e y, s e ¥2- Schematically, if P : X[E,n] Y, we 
have P[«l : J^^^g X, ^ Y. 

Lemma 8. Let P(^i-) be the channel defined in (O and let 
(P(i))['^l be the corresponding single-user channel (cf. Defini- 
tion Is}. We have for any e > 0, a < 1/2 and S C iJ,„ 



lim e {1, . 

(— s-oo n 



/((P(,))[^l)<l-2-"°}|=0. 

The proof of this lemma is given below. Let 

D,,[S] :={ze{l,...,n}:/((P(,))[^l)>l 
D„[S] :={ze{l,...,n}:/((P(,))[^l)>l 

From Lemma |8] we have that 
1 



> 1 



e}. 



max -\D4S]\Dn[S]\ 



0. 



This implies that 



1 



V„\Vn\-^0 



(12) 
(13) 

(14) 
(15) 



where 



{i e{l,...,n} : /(P(,))[J] e Z±e for any J, 
3A, e F^'^" with n = int(/(P(,)[i;™])) and 
IiAU,[Ern];Y''U[EmY-') > -7(e)} 



where 7(e) is as in Theorem |6] (The only difference between 
V and T) is in the 7(e) and e„ in the last line.) 
Since from Theorem |6] 

lim -\V^\ = 1, 

1^00 n 

we also have from (flST l 

lim -|2?„| = 1. 

1^00 n 

Finally, since the polarization process preserves the sum-rate, 
we conclude the proof of the Theorem. ■ 
Proof of Lemma |S} Note that 

(p[S])- ^ 
(p[S])+ ^ (p+)[S] 

where = means that the two transition probability distributions 
are the same and where < means that they are degraded in 
the sense 

Pi{v\x) ^ P2{y\x) if Piiy\x) = P2Wy)\x) 

for some function (j>. Hence, defining the Bhattacharyya pa- 
rameter of a single-user channel Q with binary input and 
output alphabet y by 

z{Q) - VoUmW). 

yey 

we have 

Z[(P-)[^1] = Z[(p[^l)-] < 2Z[P[^1] 

Z[(P+)[51] < z[(p[^i)+] = z[p[^i]2 

and the random process Ze = Z[(Pf)[^l] satisfies 

Ze+i < Z} if P,+i = 1, (16) 
Z^+i < 2Zf if Pf+i = 0. (17) 

We then conclude by using Theorem 3 of Ul, which shows 
that a random process which satisfied ( fTSl l and (fTTl i satisfies 
for any ^ < 1/2 

lim inf P(Zf < 2"^'") > P(Zoo = 0). 

Hence, we have proved that 

^lim i|{*e{l,...,2'}:/((P(,))[^l)>l-e, 

^((P(,))[^l)>2-^"}|=0. 

To conclude the proof of the lemma, we use the fact that 
for any binary input discrete memoryless channel Q, we have 
/(Q) + Z(Q) > 1, hence /(Q) < 1 - (5 implies Z{Q) > b. W 

'the conditions required in Theorem 3 of |3| are indeed weaker than what 
we have here 



V. Coding for the AWGN channel 

We can use the results of Section |IV] to construct capacity- 
achieving codes for the AWGN channel in the following way. 
Over an AWGN channel, by transmitting the standardized 
average of i.i.d. binary random variables, scaled to satisfy the 
power constraint, the receiver observes 

Y=^^^(X.-l/2) + Z, 

^ 2—1 

where Z is Gaussian distributed. We can view this channel as 
being a m-user BMAC, (Xi, . . . ^Xm) Y, and the polar 
code constructed in this paper can be used to communicate 
over this channel. From the central limit theorem, by taking 
m arbitrarily large, the input distribution of previous scheme 
is arbitrarily close to a Gaussian distribution, and hence, this 
coding scheme can achieve rates arbitrarily close to the AWGN 
capacity. To ensure that this scheme provides a 'low encoding 
and decoding complexity code' for the AWGN channel, one 
has to make further complexity considerations when assuming 
m arbitrarily large. First, the decoder must recover a ra- 
dimensional binary vector over each extremal MAC and the 
total (maximal) number of hypothesis is 2™. For this, the 
decoder can proceed with each of the m users individually 
(reducing the problem to m successive hypothesis tests), by 
using the marginalized single-user channel between one user 
and the output, which is an extremal channel in the single- 
user sense. Also, one maximal independent set of users needs 
to be identified for each extremal MAC, to know where the 
information bits should be sent. There is no need to check 
exponentially many sets for this purpose, since this is achieved 
in at most m steps, by using a greedy algorithm that checks 
the independence of a given set and increases the set by one 
element at each step (starting with the empty set). 

VI. Discussion 

We have constructed a polar code for the MAC with arbi- 
trarily many users, which preserves the properties (complexity, 
error probability decay) of the polar code constructions in 
[|2l, li4|. The polarization technique brings an interesting 
perspective on the MAC problem: by polarizing the MACs for 
each user separately, we create a collection of extremal MACs 
which are "trivial" to communicate over, both regarding how 
to handle noise (noiseless or pure noise) but also regarding 
how to handle interference (which is, modulo synchronization 
in the code, removed). We have also shown that the extremal 
MACs are in a one-to-one correspondence with the linear de- 
terministic MACs, i.e., MACs whose outputs are linear forms 
of the inputs. The polar code constructed in this paper is shown 
to achieve only a portion of the dominant face of the MAC 
region, which is however sufficient to achieve the uniform 
sum rate on any binary input MAC. There are examples of 
non-extremal MACs where the polar code described in this 
paper can achieve rates in the entire uniform rate region, for 
example, this is the case for a 2-user MAC whose output is 
Xi + X2 with probability half and (Xi, X2) with probability 



half. In general, this may not be the case. Finally, we have 
considered in this paper MACs with arbitrary many users but 
binary input alphabets for each user However, for a MAC 
with m users and g-ary input alphabets, where q — 2^, we 
can split each user into k virtual users with binary inputs and 
use the polar code construction of this paper to achieve the 
uniform sum rate. Furthermore, if an m-user g-ary input MAC 
requires a certain distribution to achieve the (true) sum rate, 
then, we can split each user into multiple virtual users with 
binary inputs, map the input vector of these to the channel 
inputs so that the uniform binary distribution on the virtual 
users induces an approximation of the required distribution 
(which will get better with increasing number of virtual users), 
and thus achieve the sum capacity of an arbitrary MAC. 

Appendix 

In this section, we prove Theorem |5] and Theorem |6l We 
first need an auxiliary lemma. 

Lemma 9. Let W be a binary MAC with 2 users. Let 
X[E2\ with i.i.d. uniform binary components and let Y be 
the output of W when X[E] is sent. If I{X[1];Y X[2]), 
I{X[2]]YX[1]) and I [X[l]X[2]]Y) have specified integer 
values, then I{X[l];Y),I{X[2]-Y) and I[X[1] + X[2]]Y) 
have specified values in {0, 1}. 

Proof: Let 

/ := [I{X[\]-YX[2]),I{X[2];YX[l]),I{X[l]X[2];Y)] 
J := [I{X[l];Y), I(X[2]:Y),I{X[l] + X[2]- Y)]. 

Note that by the polymatroid property of the mutual informa- 
tion, we have 

/ e {[0, 0, 0], [0, 1, 1], [1,0, 1], [1, 1, 1], [1, 1, 2]}. (18) 

Let y e Supp(y) and for any a; G F2 define P(a;|y) = 
W{y\x)/Y.zeV^^{v\^) (recall that W is the MAC with 
inputs X[1].X|2] and output Y). Assume w.l.o.g. that po '■— 
P(0,0|y) > 0. 

. If / = [0, 0, 0] we clearly must have J = [0, 0, 0]. 
. If J = 1, 1], we have I{X[2]-Y X[l]) = 1 and we can 
determine X [2] by observing X [1] and Y , which impUes 

P(01|t/) = 0. 

Moreover, since I[X{\\X) I[X{\\X\2\;Y) - 

I{X\2\,Y X\\\) = 0, i.e., X\\\ is independent of F, we 
must have that X]a;[2] I^l^Ill^^I^lly) is uniform, and hence, 

P(00|2/) = 1/2, P(10|2;) + P(ll|j/) = 1/2. 

Now, if ★ = 1, by a symmetric argument as before, 
we must have P(ll|y) = 1/2 and hence the input 
pairs 00 and 11 have each probability half (a similar 
situation occurs when assuming that P(x|?/) > for 
X ^ (0,0)), and we can only recover X\\\ + X\2\ from 
y, i.e., J = [0,0,1]. If instead ★ = 0, we then have 
I{X{2\-Y) = I{X[1]X[2]:Y) - I{X[1];YX[2]) = 1 



and from a realization of Y we can determine X[2], i.e., 
P(10) = 1/2 and J = [0,1,0]. 

• If / = [1, 0, 1], by symmetry with the previous case, we 
have J = [1,0,0]. 

• If [1, 1. 2], we can recover all inputs from Y, hence 
J = [1,1,1]. 

■ 

Proof of Theorem\5\ Let /[S'](iy) be assigned an integer 
for any S C £■,„. By the chain rule of the mutual information 

I{X[EmW = I{X[S]-Y) + I{X[S']-YX[S]), 

and we can determine I{X\S\]Y) for any S. Since for any 

T<ZS 

I{X[S]-Y) - I{X[T]-Y) + I{X[S - T]-YX[T]), 

we can also determine I{X[S];Y X[T]) for any S,T <Z 
with 5* n r = 0. Hence we can determine 

I{X[1],X[2];YX[S]) 

I{X[1];YX[S]X[2]) 

I{X[2];YX[S]X[1]) 

and using Lemma |9l we can determine 

I{X[1]+X[2];YX[S]) 
for any S C E„i with {1, 2} ^ 5, hence 
I{X[^ + X[j];Y) 

for any 

Assume now that we have determined I{J2t ^^['S']) 
for any T with |r| < fc and S" C £;„ - T. Let T = {1, . . . , fc} 
and let 5" C {fc + 2, . . .,m}. 

I{J2xm,X[k+l]-YX[S]) 

T 

= I{X[k + l\:YX[S]) + I{Y,X\i\;YX[S]X[k +1]), 

T 

in particular, we can determine 

I{X[k + l]-YY,X\i],X[S]) 

T 

= I[Y,X\i],X[k+l]-YX[S]) 

T 

-I{Y,X\ilYX[S]) 

T 

and 

i{Y,m,x[k+i]-Yx[s]) 

T 

i{Y,m:Yx[s]x[k+i]) 

T 

I{X[k + l]-YY,X\i],X[S]) 

T 



and using Lemma |9l we can determine 

T 

hence 

T 

for any T C Em with |T| = k + 1. Hence, inducting this 
argument, we can determine ^(X^t W' ^) ^^i" ^'^y ^ — 

Note that the values of these mutual informations must 
be consistent, for example, if + X[2];y) = 1 and 

I{X[1]+X[3];Y) = 1, we must have I{X[2]+ X[3];Y) = 1. 
Hence, the T's for which ^(X^t^I*]'^) assigned 1 must 
be in agreement with these linear relationships, which can be 
compactly expressed as I{AX[E„i];Y) ~ rank(yl) for some 
binary matrix A. Finally, one can check directly (or by using 
Theorem IS that rank(A) = I{X[Em];Y). ■ 

In order to prove the "approximative" version of Theorem 
|5] i.e.. Theorem |6l we need the following lemma which is 
a corollary of Lemma 33 in |14|. The proof of Theorem |6] 
follows then from Lemma [lO] and the proof of Theorem |5] 

Lemma 10. Let W be a binary MAC with 2 users. Let X [E2] 
with i.i.d. uniform binary components and let Y be the output 
ofW when X[E] is sent. If I{X[1];YX[2]), I{X[2];YX[1]) 
and I{X[1]X[2];Y) have specified integer values within e, 
then /(X[l];y),/(X[2];y) and 1{X\V\ +X[2];r) have 
specified values outside (7(e), 1 — 7(e)) with 7(e) 0. 
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